Query Optimization for Semistructured Data Using Path Constraints in a Deterministic Data Model

نویسندگان

  • Peter Buneman
  • Wenfei Fan
  • Scott Weinstein
چکیده

Path constraints have been studied in [4, 11, 12, 13] for semistructured data modeled as a rooted edge-labeled directed graph. They have proven useful in the optimization of path queries. However, in this graph model, the implication problems associated with many natural path constraints are undecidable [11, 13]. A variant of the graph model, called the deterministic data model , was recently proposed in [10]. In this model, data is represented as a graph with deterministic edge relations, i.e., the edges emanating from any node in the graph have distinct labels. The deterministic graph model is more appropriate for representing, for example, ACeDB [27] databases and Web sites. This paper investigates path constraints for the deterministic data model. It demonstrates the application of path constraints to, among other things, query optimization. Three classes of path constraints are considered: the language Pc introduced in [11], an extension of Pc, denoted by P w c , by including wildcards in path expressions, and a generalization of P c , denoted by P c , by representing paths as regular expressions. The implication problems for these constraint languages are studied in the context of the deterministic data model. It shows that in contrast to the undecidability result of [11], the implication and nite implication problems for Pc are decidable in cubic-time and are nitely axiomatizable. Moreover, the implication problems are decidable for P c . However, the implication problems for P c are undecidable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Optimization for Semistructured DataUsing Path

Path constraints have been studied for semistructured data modeled as a rooted edge-labeled directed graph 4, 11{13]. In this model, the implication problems associated with many natural path constraints are undecidable 11, 13]. A variant of the graph model, called the deter-ministic data model, was recently proposed in 10]. In this model, data is represented as a graph with deterministic edge ...

متن کامل

Path Constraints for Databases With or Without Schemas

This dissertation introduces a path constraint language and investigates its associated implication and finite implication problems. This path constraint language has proven useful in a variety of database contexts, ranging from semistructured data as found for instance on the Web, to structured data such as data in object-oriented databases. It is capable of expressing natural integrity constr...

متن کامل

Path constraints in semistructured data

We consider semistructured data as multi rooted edge-labeled directed graphs, and path inclusion constraints on these graphs. A path inclusion constraint p q is satisfied by a semistructured data if any node reached by the regular query p is also reached by the regular query q. In this paper, two problems are mainly studied: the implication problem and the problem of the existence of a finite e...

متن کامل

Modeling and Querying Web Data: A Constraint-Based Logic Approach

The efficient and sophisticated representation of the structure of the documents being circulated over the Internet allows for effective querying and reasoning over them. This is a major goal for large information resources like the World Wide Web (WWW). Constraints are a valuable tool for managing information. In this work, we consider how constraintbased technology can be used to query and re...

متن کامل

Path Constraints in Semistructured Databases

General rights Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The University of Edinburgh has made every reasonable effort to ensure that Edin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999